Cultural Capitals - Read Me

[because the original works are under copyright, we cannot freely share them. However we provide full tables of all place mentions matched on our data and used in our analysis as well as sample passages for review in addition to our code]


File Description

1. Country Designations used for matching
min_countries_allNames_simplified.csv
min_countries_allNames.csv

2. List of our data and country codes
min_countries_ourData.csv

3. Figures
min_Fig1_NationalPrevalence.pdfmin_Fig2_AllGeoPrevalence.pdfmin_Fig3_NationalismQuotient.pdfmin_Fig4_RatioRidges.pdfmin_Fig5_recallByCorpus.pdf

4. Custom lists of place names manually validated
min_geo_custom1.csvmin_geo_custom2.csv

5. Description of variables of our measures by work for our final output table
min_geo_finalTable_ReadMe.txt

6. Final outputs of our measures by work
min_geo_finalTable.csv

7. All geotagged locations in our data
min_geoTags_all.csv

8. Annotated geotagged locations in our data [i.e. resolved to a country]min_geoTags_Annotated_All.csv

9. All passages validated manually by student coders and reviewed by PIs
[this can be a useful snapshot of the underlying data used in the project]
min_Validation_All.csv

10. All place names validated manually, including true positives, false positives, and false negatives [subset of 9]
min_Validation_ErrorTable.csv

11. Output of the true prevalence estimation using method by Messam et al.
min_Validation_TruePrevalenceTable_AllGeo.csvmin_Validation_TruePrevalenceTable_National.csv

12. Wiki "country of" pages
min_Wiki_HTML

13. Code
minorLit.R

14. Metadata on our corpus
[this includes a column for the "National_Names" used to match national self-references described in our paper]
min_meta_master.csv


